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ABSTRACT 

The problem of optimal node density for ad hoc sensor networks deployed 
for making inferences about two dimensional correlated random fields is 
considered. Using a symmetric first order conditional autoregressive Gauss- 
Markov random field model, large deviations results are used to character- 
ize the asymptotic per-node information gained from the aiTay. This result 
then allows an analysis of the node density that maximizes the information 
under an energy constraint, yielding insights into the trade-offs among the 
information, density and energy. 

1. INTRODUCTION 

We consider the design of wireless ad hoc sensor networks for 
making inferences about correlated random fields that can model 
various physical processes, such as temperature, humidity or the 
density of a certain gas, in a two-dimensional (2-D) space. In par- 
ticular, we consider the optimal density problem for sensor net- 
works deployed for statistical inference such as detection or recon- 
struction of the underlying field. From the information-theoretic 
perspective, statistical inference via sensor networks can be viewed 
as a problem of extracting information about an underlying phys- 
ical process using networked sensor nodes that consume energy 
for both sensing and communication. Thus, the optimal density 
problem can be formulated as follows. 

Problem 1 Given a sensor network deployed on a fixed coverage 
area of size 2L x 2L and with total available energy E, find the 
node density fin that maximizes the total information It obtainable 
from the network. 

To address this problem, we model the signal field as a 2-D Gauss- 
Markov random field (GMRF), and consider the Kullback-Leibler 
information (KLI) and mutual information (MI) (T| as ways of 
quantifying inferential performance. (The operational meaning of 
the KLI is given by its appearance as the error exponent of the 
miss probability of Neyman-Pearson detection of the signal field 
in sensor noise, whereas that of the MI is given by its role as a 
measure of uncertainty reduction.) Our approach to determine the 
total information obtainable from a sensor network is based on 
the large deviations principle (LDP). That is, for large networks, 
the total information is approximately given by the product of the 
number of sensors and the asymptotic per-node information, or the 
asymptotic information rate. (The units of these intensive quanti- 
ties is thus nats/sample.) Although closed-form expressions for the 
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asymptotic per-node information are not available for general 2-D 
signals, for the conditional autoregression (CAR) model closed- 
form expressions for the asymptotic KLI and MI rates have been 
determined by the authors in (2|- Based on these expressions for 
asymptotic information rates and their properties, in the current 
paper we investigate the problem of optimal node density. It is 
seen that there exists a density maximizing the total information 
obtainable under an energy constraint. The optimal density is eas- 
ily obtained numerically, and the behavior of the total information 
as a function of the density is explained. 

1.1. Related Work 

The issues of optimal sensor density and optimal sampling have 
been considered based on LDP in previous work (e.g., O). How- 
ever, most work in this area is based on one-dimensional (1-D) sig- 
nal or time series models that do not capture the two-dimensionality 
of actual spatial signals. In contrast, our work is based on the 
LDP results obtained in (2, where a closed-form expression for 
the asymptotic KLI rate is obtained in the spectral domain. For a 
2-D setting, an error exponent was obtained for the detection of 
2-D GMRFs in (2, where the sensors are located randomly and 
the Markov graph is based on the nearest neighbor dependency 
enabling a loop-free graph. In that work, however, measurement 
noise was not considered, unlike the present analysis. 

2. SIGNAL MODEL AND BACKGROUND 

In this section, we briefly introduce our previous work (2) relevant 
to the sensor density problem. To simplify the problem and gain 
insight into the 2-D case, we assume that sensors are located on a 
2-D lattice I„ = [— n : 1 : n]^, as shown in Fig. [T] and thus form 
a 2-D array. We model the underlying physical process as a 2-D 
GMRF and assume that each sensor has Gaussian measurement 
noise. So, the observation Yij of Sensor ij on the 2-D lattice X„ is 
given by 

Y,j +w^,j, ij ex„, (1) 

where {Wij} represents independent and identically distributed 
(i.i.d.) zero-mean Gaussian measurement noise with variance cr^, 
and {Xij^ is a GMRF on X„, independent of {VFij}. Note that 
the observation samples form a 2-D hidden GMRF on X„ • In the 
following, we summarize our relevant LDP results on GMRFs that 
will be useful in the sequel. 




Fig. 1. 2-D sensor array on a lattice T„: Hidden Markov structure 



Definition 1 (GMRF g)) A random vector X = (Xi, X2, • • • , 
X„ ) G R" is a Gauss-Markov random field with respect to (w.r. t. ). 
a labelled graph Q — {v,£) with mean vector fi and precision 
matrix Q > 0, ;/ its probability density fimction is given by 



p(X) = (2^)-"/2|Q| 
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and Qim 7^ {I, m} G Sfor all I 7^ m. Here, v is the set of 
all nodes {1, 2, ■ ■ ■ , n} and £ is the set of edges connecting pairs 
of nodes, which represent the conditional dependence structure. 

Note that the 2-D indexing scheme ij in lUll can be properly con- 
verted to an 1-D scheme to apply Definition [T] From here on, we 
use the 2-D indexing scheme for convenience. 

Definition 2 (The Conditional Autoregression (CAR)) A GMRF 
{Xij } is said to be a conditional autoregression if it is specified 
using a set of full conditional normal distributions with means and 
precisions: 



E{Xy|X_y} 



1 

"(ho 



eoo > 0, (4) 



Prec{Xy|X_y} 
where 'K.-ij denotes the set of all variables except Xij. 

By imposing first order symmetry on the correlation structure, we 
have the symmetric first order conditional autoregression (SFCAR) 
defined by the conditions 

E{Xij|X_ij} — — (Xi+ij + Jfi_i_j + Xij+i + 
Prec{Xij|X_ij} = 



K > 0, 



where < A < -1. Here, 9oo = and 61,0 = = Oo.i = 

^0,-1 ~ —A. The SFCAR model is the 2-D extension of the 1- 
D autoregressive (AR) model that is widely used to model basic 
correlation in 1-D. Here in the 2-D case we have symmetric con- 
ditional dependence on four neighboring nodes in the four (pla- 
nar) directions, capturing basic 2-D correlation structure. It can be 
shown that the GMRF defined by the SFCAR model is a zero-mean 
stationary Gaussian process on Too with power spectral density 151 
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where the edge dependence factor C, is defined as 



C = -, o<c<i/4. 



(5) 



(6) 



The SFCAR model is useful especially because the correlation 
strength is captured in this single quantity C, for SFCAR signals, 
which enables us to investigate the per-node information as a func- 
tion of the field correlation. Here, C = corresponds to the i.i.d. 
case, whereas = 1/4 corresponds to the perfectly correlated 
case. Henceforth, we assume that the 2-D stochastic signal {Xij} 
in (UJ is given by a stationary GMRF defined by the SFCAR model, 

as n ^ 00. The signal power P = E{Xoo}^ (= E{X|, } V 
is obtained using the inverse Fourier transform, and is given by 
P = ^^k'^' . (0 < C <\)^ where K{-) is the complete elliptic 
integral of the first kind |3- Thus, the measurement SNR is given 
bySNR=4 = 2il(iC). 



2.1. Large System Analysis: Per-Node Information 

The key idea behind the large system analysis here is that, un- 
der the stationarity assumption, the amounts of information from 
the node become identical regardless of sensor location as the net- 
work size grows, and the total amount of information is given ap- 
proximately by the product of the number of sensor nodes and the 
(asymptotic) per-node information. The asymptotic per-node KLI 
and per-node MI are defined as 

^ ■ log — ({Kij , ij e X„}) a.s. under po, and 



%s = lim ,^ 

1 



Pi 



lim y^I{{Xij,ij eI„};{Yij,ij ein}), 

respectively. For the MI, the signal model (T} is applicable di- 
rectly, whereas for the KLI the probability density functions of the 
null (noise-only) and alternative (signal-plus-noise) distributions 
are those given under the respective models 



PoiYij) : Yij = Wij, ij e I„, 
PiiYij) : Yij = Xij + Wij, ij G X„ 



(7) 



The following closed-form expressions for the asymptotic per-node 
information in the spectral domain have been obtained in (2| by ex- 
ploiting the spectral structure of the CAR signal and the relation- 
ship between the eigenvalues of block circulant and block Toeplitz 
matrices representing 2-D congelation structure. 

Tfieorem 1 Under the 2-D SFCAR signal model, the asymptotic 
per-node KLI %s and per-node MI 3 s are given by 



2/,r)J<(4C)(l - 2C cos 1^1 - 2C cos 1^2) 
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respectively. 

Note that the SNR and correlation are separated in l[8]l-l[9]l, which 
enables us to investigate the effects of each term separately. With 
regard to %s and 3s as functions of it is readily seen from Theo- 
rem[T]that 3Cs and 3s are continuously differentiable functions 
of the edge dependence factor C (0 < C ^ 1/4) for a given SNR 
since f : x ^ K{x) is a continuously differentiable C°° function 
for < x < 1 Q. Fig. |2]shows %s as a function of for several 
different SNR values. It is seen in the figure that at high SNR 'Xs 
decreases monotonically as the correlation becomes strong, i.e., 
^ — > 1 /4. At low SNR, on the other hand, correlation is beneficial 
to the performance. Js shows similar behaviors even if it is not 
shown here. 

With regard to %s and Is as functions of SNR, the behavior is 
given by the following theorem from |[2l. 

Tlieorem 2 %s and '3a are continuous and monotonically increas- 
ing as SNR increases for a given edge dependence factor < 
C, < 1/4. Moreover, JCs and 3s increase linearly with respect 
to i log SNR as SNR oo. As SNR decreases to zero, on the 
other hand, Xs and 3s decrease to zero with convergence rates 

Xs{SNR) = c- SNR^ + o(SNR^), (10) 

3s{SNR) = c ■SNR + oi.SNR), (11) 

respectively, for some constants c and c'. 
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Fig. 2. Xs as a function of (: (a) SNR = 10 dB, (b) SNR = dB, 
(c) SNR = -3 dB, (d) SNR = -5 dB (from (2)) 



3. AD HOC SENSOR NETWORKS: OPTIMAL DENSITY 

Based on the results in the previous sections, we now address the 
optimal density problem given in Section[T] 



3.1. Physical correlation model 

As we vary the node density for a given area with size 2L x 2L, 
the sensor spacing d„ changes. In turn, the edge dependence factor 
between two adjacent samples varies for given physical diffusion 
parameters. So, we first derive the relationship between sensor 
spacing d„ and the edge dependence factor C, for the SFCAR. The 
physical correlation for the SFCAR model is obtained by solv- 
ing the continuous-index equivalent given by the 2-D stochastic 
Laplace equation [8) 



d ^ ^ 
dx . 



X{x,y) = u{x,y), 



(12) 



where u{x, y) is the 2-D white zero-mean Gaussian perturbation 
and a > is the diffusion rate. By solving this equation, the edge 
correlation factor p is given, as a function of the sensor spacing 
d„, by (U 



P = 



A E{XooXio} 
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where Ki{-) is the modified Bessel function of the second kind. 
The correlation function l ll3t can be regarded as the representative 
correlation in 2-D, similar to the exponential correlation function 
g-Ad„ gQfi^ functions decrease monotonically w.r.t. d„. 

However, the 2-D correlation function is flat at ii„ = |[8j. Fur- 
ther, we have a mapping g : p ^ ( from the edge correlation 
factor p to the edge dependence factor C,, given by (9| 



(2A)A-(4C) ~ 1 
4(2/^)C7^(4C) 



(14) 



which maps zero and one to zero and 1/4, respectively. Combining 
^ and dl, we have a mapping ( = h{g{d„)) from the sensor 
spacing dn to for the SFCAR model. 



3.2. Density Analysis 

The assumptions for the planar ad hoc sensor network that we con- 
sider is summarized in the following. 

(A.l) (2?! + 1)^ sensors are located on the lattice Jn = [~n : 
1 : n]^ with spacing dn, as shown in Fig. [T] and a fusion 
center is located at the center (0, 0). The observation sam- 
ples {Yij} at sensors form a 2-D hidden SFCAR GMRF on 
the lattice, and the correlation functions are given by l |13t - 
(EJ. 

(A.2) The fusion center collects the measurement from all nodes 
using minimum hop routing. A hop count of |i| + \j\ is re- 
quired for minimum hop routing to deliver Yi j to the fusion 
center. 

(A.3) The communication energy per edge is given by Ec{d„) = 
Eod^, where v > 2 is the attenuation factor of wireless 
propagation in the physical layer. 

(A.4) Sensing requires energy, and the sensing energy per node is 
denoted by Es. Further, we assume that the measurement 
SNR increases linearly w.r.t. Es, i.e., SNR — /3Eb for 
some constant /3Q 

The density optimization under the energy constraint can be solved 
using our large system analysis in the previous sections assuming 
the asymptotic result is still valid in low density case. The total 
amount It of information is given by 

It = (2n + 1)^X4 SNR, d„) or h = (2n + l)^^, (SNR, <i„), (15) 

for KLl or Ml, respectively. The total energy E required for data 
collection is given by 



{2n + if Es + E^{dn) 
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(2n + ifEs + 2n(n + l)(2n + l)Ec{d„). 



(16) 



Thus, Problem 1 can be reformulated as 



= argmax (2L)^^i„3Cs (SNR(£;, p„ ) , d„ (^i„ )) , (17) 
s.t. (2n + ifEsifJ-n) + 2n(?i + l)(2n + l)E^{d„{fi„)) < E, 

where the sensing energy Es as well as n and dn are functions 
of the node density pn- From /i„ (= (2n + 1)^/(2L)^), we first 
calculate n and then dn = L/n. When dn is determined, Ec{d„) 
is obtained from the propagation parameters Eo and f, and then 
Es{pn) is obtained from the constraint in l ll7t . Once Es{pn) is 
determined, the measurement SNR is calculated using Assumption 
(A.5), i.e., SNR = /SEs and finally we evaluate the per-sensor in- 
formation 3<:s(SNR, C(p(dn))) and J^SNR, C(p(rfn))) from The- 
orem [T] 

Fig. |3] shows the total information obtainable from 2x2 
square meter area as we vary the node density pn with a fixed 
total energy budget of E joules. Other parameters that we use are 
given by 

a = 100 (diffusion rate), /3 = 1, -Bo = 0.1 and = 2. 



' Suppose that Ei joules are required for one sensing to obtain one 
sample Yij (m) = Xij{'m) + Wij (m) at sensor ij and the measurement 
SNR of this sample is SNRi. Now assume that we obtain A/ samples 
(m = 1, • ■ ■ , M) using M subsensors at the same location ij simultane- 
ously, requiring M ■ E\ joules, and we take an average of these M samples 
at sensor ij, yielding an effective sample Yij = (1/M) Y2m Yij{m) of 
SNR of MSNRi assuming that the measurement noise is i.i.d. across the 
subsensors. 
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Fig. 3. (a) total KLI vs. density and (b) total MI vs. density 



Here, the values of E, Eq and j3 are chosen so that the minimum 
and maximum per-sensor sensing SNR's are roughly -10 to 10 dB 
for maximum and minimum densities, respectively. The diffusion 
rate a — 100 is selected for the edge congelation coefficient p to 
vary from almost zero to 0.6 as the node density changes. It is seen 
in the figure that there is an optimal density for each value of E 
for both information measures. It is also seen that the total KLI is 
sensitive to the density change whereas the total MI is less sensi- 
tive. The existence of the optimal density is explained as follows. 
At low density, we have only a few sensors in the area. So, the 
energy for communication is not large due to the small number of 
communicating nodes and most of the energy is allocated to the 
sensing energy; the per-node sensing energy is even higher due to 
the small number of sensors. However, the per-node information 
increases only logarithmically w.r.t. the sensing energy or SNR 
by Theorem |2l and this logarithmic gain cannot compensate for 
the loss in the number of sensors. Therefore, low density yields 
very poor performance, and large gain is obtained initially as we 
increase the density from very low values as seen in Fig. [3] As we 
further increase the density, on the other hand, and the per-node 
sensing energy or SNR decreases due to the increase in the over- 
all communication and the increase in the number of sensor nodes, 
and the measurement SNR is eventually at low SNR regime, where 
l llOt and hold. From l ll6t . we have 

Es{ti„)=r^ SNR ^Oin-^) (18) 

for fixed E and Ec = Eo{L/nf, as n — > oo. By the low SNR 
behavior of %s given by l llOt . the behavior of the total Kullback- 
Leibler information is given by 

Total KLI = (2i) VnJCs = 0(n^l"'*) = 0{n'^) = 0(Atn ^) 

and by i ll It the total mutual information is given by 

Total MI = {2Lf finds = 0(n^n"^) = 0(1). 

This explains the initial decay after the peak in Fig.[3](a) and quite 
flat curve in Fig.[3](b). In the above equations, however, the effect 
of ( on OCs and "Js is not considered. As the node density increases, 
the sensor spacing decreases and the edge dependence factor C, in- 
creases for a given diffusion rate a. The behavior of the per-node 
information as a function of ^ is shown in Fig. |2] Note in Fig. [2] 
that the per-node information has a second lobe at strong correla- 
tion at low SNR while at high SNR it decreases monotonically as 
the correlation becomes strong. The benefit of sample correlation 
is evident in the low energy case {E — 50 [J]) in [3] (a); the sec- 
ond peak around fj,n ~ 95 [nodes/m^] is observed. Note that the 
second peak is not so significant. Since the per-node information 



decays to zero as — > 1/4 eventually, the total amount of informa- 
tion decreases eventually, as seen in the right corner of the figure, 
as we increases the node density. 

4. CONCLUSIONS 

We have considered the design of 2-D arrays of networked sensors 
for making inferences about 2-D correlated random fields. Under 
the SFCAR GMRF model, the density maximizing the total in- 
formation obtainable from the network under an energy constraint 
has been investigated. We have seen that such an optimal den- 
sity exists. At low density, the amount of information gathered is 
small because the logarithmic increase in the per-node informa- 
tion w.r.t. energy cannot compensate for the loss in the number of 
sensor nodes. At high density, on the other hand, the performance 
degrades mainly due to too much correlation between samples and 
low sensing energy. The optimal node density effects a trade-off 
between these two effects. 
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